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A measurement of vector boson (y) production in conjunction with a D*(2010)^ meson is presented. 
Using a data sample corresponding to 9.7fb“^ of proton-antiproton collisions at center-of-mass en¬ 
ergy ^/s = 1.96 TeV produced by the Fermilab Tevatron, we reconstruct V + samples with the 
CDF II detector. The is fully reconstructed in the D*(2010)^ ^ decay mode. 

This technique is sensitive to the associated production of vector boson plus charm or bottom mesons. 
We measure the ratio of production cross sections a{W-\-D*)/a{W) = [1.75±0.13(stat)±0.09(syst)]% 
and a{Z + D'^)la{Z) = [1.5 ± 0.4(stat) ± 0.2(syst)]% and perform a differential measurement of 
dcfiW + D*)/ dpT{D*). Event properties are utilized to determine the fraction of U + D*(2010)^ 
events originating from different production processes. The results are in agreement with the predic¬ 
tions obtained with the PYTHIA program, limiting possible contribution from non-standard-model 
physics processes. 

PACS numbers: 13.85.Ni,14.40.Lb,12.38.Qk,14.70.Fm,14.70.Hp 


In high-energy proton-antiproton {pp) collisions, pro¬ 
duction of a vector boson, U, either a or Z^ boson, 
can occur in conjunction with one or more heavy quarks, 
Q, where Q = c,b. Standard model processes that con¬ 
tribute to U + Q final states include direct (nonresonant) 
production as well as U + Q final states originating from 
decays of top quarks and Higgs bosons. In addition, high 
mass, non-standard-model states could also contribute, 
and therefore a detailed study of V F Q processes could 
be sensitive to these contributions. 

In the standard model, production and decay ofVpQ 
states is governed by a combination of the strong and 
weak interactions. For example, the pp ^ W F c process 
is sensitive to the magnitude of the Cabibbo-Kobayashi- 
Maskawa matrix element Vcs as well as the strange quark 
distribution function of the proton mn]. Measurements 
of this process, along with the related processes VFbb and 
V F cc, can be used in conjunction with our knowledge of 
the weak interaction in order to test our understanding of 
the strong interaction at high energy [3]. Study of these 
processes also helps to provide better modeling of the 
backgrounds relevant to non-standard-model searches at 
the Tevatron and LHC. 

Previous Tevatron analyses of pp ^ V F c produc¬ 
tion have measured the absolute and relative cross sec¬ 
tions cr(V + c)/a{V + jets) [4]-[7]. Restrictions on jet 
acceptance along with the heavy flavor identification 
techniques limited these analyses to high-momentum 
(pT(jet) > 20 GeV/c) charm and bottom hadrons. In 
addition, systematic uncertainties arose in the inclusive 


tagging techniques used to separate light-quark, gluon, 
and heavy-quark contributions. Similar measurements 
have been carried out at the LHC [HHin]. 

Here, we present a complementary technique in which 
the charm decay is fully reconstructed as a L)*(2010) + 
meson (denoted as I)*+) through the decay chain ^ 
followed by K~7t^ [TT]. The subscript 

on TTs is used to denote a “soft” pion with low aver¬ 
age momentum compared to the other pion arising from 
the decay. This full charm reconstruction provides 
additional information (charge correlation between de¬ 
cay products, impact parameter) to further classify sig¬ 
nal contributions. With this identification technique, an 
improved measurement range of transverse momentum 
> 3GeV/c is achieved, with an average value 
of lOGeV/c. In addition to U + c, the sample identified 
with this technique contains contributions from V F cc 
and charm from the sequential decay of the V F bb pro¬ 
cess. Full reconstruction of charmed final states in W 
boson events has been carried out at the LHC [9]. 

In reconstructing low-momentum charm hadrons in 
vector boson events, this work isolates a data sample 
that has been previously unexplored. To date, anoma¬ 
lous charm hadron production in pp collisions, particu¬ 
larly at low Pt{c), has not been ruled out. By isolating 
and quantifying a unique set of U + c events, this analysis 
constitutes a new test of the standard model. 

This work utilizes the full data set collected by the 
CDF H detector at the Fermilab Tevatron collider, with 
an integrated luminosity of 9.7 fb“^ for pp collisions 
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FIG. 1: Distribution of dielectron (left) and dimuon (right) invariant mass. The fit to the data is described in the text. 


at ^/s = 1.96 TeV. The CDF II detector [12] is a 
general-purpose magnetic spectrometer surrounded by a 
projective-tower calorimeter and a muon detector. The 
CDF II central tracking volume consists of a silicon detec¬ 
tor surrounded by a large open-cell drift chamber. The 
entire tracking volume is contained within a uniform axial 
magnetic field parallel to the proton beam direction [T3| . 

Events are selected with a three-level online event se¬ 
lection system (trigger). This analysis considers events 
selected by an inclusive high-p^ lepton trigger requir¬ 
ing an electron (muon) with Et > 18 GeV {pt > 
18 GeV/c) US]. From this high-p^ lepton data set, we 
select vector-boson events as described below. 

A candidate is required to have two oppositely- 
charged, isolated electrons (muons), each with Et > 
25 GeV (pt > 20 GeV/c) and \r]\ < 1.1. A lepton is 
considered isolated if / < 0.1, where / is the isolation, 
defined as ratio of the total transverse energy in a cone 
of radius AR = ^+ Ar]‘^ = 0.4 about the lepton 
track (excluding the lepton) with respect to Et{pt) of 
the electron (muon). The invariant mass of the lep¬ 
ton pair is required to fall between 66 and 

116GeV/c^. Background in the candidate sample is 
estimated by fitting the invariant mass distribution to a 
double-Gaussian signal plus an exponential background 
hypothesis. The number of signal events is obtained 
by integrating the double-Gaussian distribution, yield¬ 
ing 241450 ± 580 (257 580 ± 550) ^ ee{pp) events, 

where the uncertainty is statistical only. The dilepton 
data and fits are shown in Fig. 

A candidate W~^ event is required to have missing 
transverse energy ^t > 25(20) GeV [T4|, one isolated 
electron (muon) with Et > 25 GeV (pt > 20 GeV/c) 
and pseudorapidity \r]\ < 1.1. Missing transverse energy 
is corrected for muons and other instrumental effects that 
may produce false Finally, the transverse mass of the 
IF, Mt = 2Et^t{^ — cos A(/)), where Et is the trans¬ 

verse energy of the lepton and A0 is the azimuthal angle 


TABLE I: Boundaries used to define regions A, B, G, and 
W for W candidates in the ^t-I plane. Regions A, B, and 
G consist primarily of multijet background, and are used to 
estimate the multijet background content of signal region W. 
Numbers in parentheses are for pEv when different 

from the boundaries. 


Region 

range 

I range 

A 

< 10 GeV 

< 0.1 

B 

< 10 GeV 

> 0.3 

G 

> 25(20) GeV 

> 0.3 

W 

> 25(20) GeV 

< 0.1 


between the lepton and missing transverse energy, must 
satisfy Mt > 20 GeV/c^. 

Background to the W candidates consists of elec- 
troweak sources {Z £~^£~ and W ^ ru events, de¬ 
noted as EWK hereafter) and from hadrons displaying a 
similar topology as W events (multijet background). We 
estimate the number of all such background events by 
first relaxing the selection criteria on lepton I and event 
and then defining four regions A, B, C, and W in the 
^tII plane (Table [^. Region W is the signal region as 
defined earlier, while the other regions contain primarily 
multijet events mimicking the W signature. Lepton iso¬ 
lation versus missing transverse energy is shown in Fig.[^ 

We estimate x where yf^ is 

the number of multijet events in region V, as done in 
Ref. [15]. The contamination arising from EWK and 
residual signal events in regions A, B, and C is estimated 
using the pythia 6.2 Monte Carlo simulation [16]. The 
simulation is used to predict the backgrounds relative to 
the yield of LF £ 1 ^ events in signal region W. By relat¬ 
ing other contributions to the number of signal events, 
we produce a system of linear equations to be solved for 
the number of LF £v signal events in region W. The 
solutions determine the contribution of each background 
in each region. Of 5 081938 (5 348 975) candidate events 
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FIG. 2: Lepton isolation versus missing transverse energy for electron (left) and muon (right) events passing the selection 
criteria described in the text. The regions identified in Tableware shown graphically on the plots. The W signal is dominant 
in the region I < 0.05 and 20 < < 50GeV/c. 


in region W, 93.6 (91.5)% are W eu^jiu) signal. Un¬ 
certainties in these estimates are considered later in the 
determination of fractions N{W + D''^)/N{W). 

For all W and Z candidate events, we search for 
mesons by considering all reconstructed charged parti¬ 
cles (tracks) within 2.0 cm longitudinally of the point 
of closest approach of the high-p^ lepton to the beam¬ 
line {\Az\ <2.0 cm). For each possible set of three tracks 
with unit total net charge, we hypothesize a match to the 
^ iF“7r+)7r+ decay mode and test for consis¬ 

tency. To ensure well-measured tracks, each track must 
satisfy \r]\ < 1.1, and Pt{K) and Pt{7t) > 400 MeV/c, 
Pt (tTs ) > 80 MeV/ c. We also require AR < 1.1 for each 
pair of tracks. As is CKM-favored ver¬ 
sus K^7t~ by a factor of 10^, the two pions are 

required to have the same charge. 

The resulting sample is subjected to a kinematic fit to 
reconstruct and vertices from the iF, tt and tt^ 
track candidates. The K and tt candidates must inter¬ 
sect to form a candidate vertex, and the candi¬ 
date must intersect with the tt^ to form a vertex. 

Since direct charm {e.g., pp LFc, pp Wcc) as well 
as indirect charm {e.g., Wbb with b ^ c) are of inter¬ 
est, the trajectory of the reconstructed is not re¬ 
quired to point back to the primary pp vertex. Finally, 
we require the mass, determined by the fit, to fall 
within 3cr = 0.033 GeV/c^ of the known mass value, 
moo = 1.865 GeV/c^ [TT]. 

To reduce background in the selection, we train 
an artificial neural network (ANN) to discriminate among 
candidates using 19 variables as described in chap¬ 
ter 5 of Ref. [18]. These variables include charged- 


particle momenta and opening angles; the distance be¬ 
tween the pp collision vertex and the reconstructed 
and vertices; and track impact parameters. To 
train the ANN, we provide samples of signal and back¬ 
ground events. For signal, we generate a sample of 
W + events using pythia 6.2 m followed by a 
full detector simulation. For ANN training, we define 
the signal sample as all simulated W + candidates 
with mass difference. Am = m{K7r7rs) — m{K7r), within 
3<j = 0.0029 GeV/c^ of the known mass-difference value 
Am{D*~^ — D^) = 0.1455 GeV/c^ [TT]. To model back¬ 
ground for ANN training, we choose candidates in 
data that pass all fitting and kinematic selection criteria, 
but whose final-state particles do not have the proper 
sign correlations {e.g., K~7r~7rf). To model background 
events as accurately as possible, we require these back¬ 
ground candidates to fall within 3.2cr of the peak, 
a compromise that allows an equal number of signal and 
background events for optimal ANN training. Control 
data samples as well as simulated samples are utilized to 
verify that the ANN is well behaved and unbiased [18]. 

We map ANN output scores to the region [—1.0,1.0], 
with more positive numbers being more signal-like. Can¬ 
didates are required to have an ANN score greater than 
0.0 to be classified as signal. This suppresses 80% of the 
background while retaining 90% of the signal. 

With this algorithm, we search the selected vector- 
boson-candidate events for candidates. To measure 
the yield, we fit the Am distribution to a power-law 
background plus double-Gaussian signal hypothesis. We 
use a template for the double-Gaussian signal, taken from 
a fit to simulated signal events, with the signal width in- 
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creased by a factor of 1.1. This accounts for the slightly 
better mass resolution observed in the simulation [18] . 

We obtain the following yields in the W candidate sam¬ 
ple: A^(lT(ez/)+T)*+) = 340±30andA^(lT(/iz/)+T)*+) = 
294 ± 26, where the uncertainties are statistical only. In 
the Z candidate sample, we observe N{Z{ee) + = 

22 ± 9 and N{Z{/j./j.) + = 20 ± 7 events. When 

the electron and muon decay channels are combined, 
the yields are N{W{jj^ujeu) + = 634 ± 39 and 

N{Z(^fifijee) + = 42 ± 11. The mass-difference 

distributions are shown in Fig. Yields reported here 
are the number of reconstructed in events passing V 
candidate selection criteria, and therefore represent the 
number of reconstructed produced in conjunction 
with both signal and background V candidates. In the 
following, we account for the background V contribution 
in V + candidate events. 

For Z candidates, we define two regions in mass: 
The signal region is defined as — 91 GeV/c^| < 

3(7, and the background region is defined as — 

91 GeV/c^l > 3(7, with a = 2.0 (3.0) GeV/c^ for Z 
lili{ee) bounded by the 66 < m£+£- < 116 GeV/c^ mass 
window. We fit the mz distribution to a double-Gaussian 
signal plus exponential background hypothesis, and inte¬ 
grate beneath the curves to estimate the signal and back¬ 
ground yields in each region. For each region, we fit the 
Am for all candidates using our previously defined 
signal plus background fitting function. This provides 
yields in the signal and background regions. Two 
coupled linear equations are used to solve for the rates 
at which signal (background) Z candidates 
are produced with mesons as described in chapter 6 
of Ref. HH] . 

After measuring these rates, the procedure is repeated 
for various choices of signal and background m{£~^£~) 
boundaries. We fit the set of all values found us¬ 

ing these definitions to a constant value hypothesis, and 
take the variation in this fit to be the systematic uncer¬ 
tainty due to the unknown rate of mesons produced 
in association with V backgrounds. This is done sepa¬ 
rately for Z ^ e+e“ and decay channels, giving a 

systematic uncertainty in of 20% for the Z ^ fifi 

channel, and of 11% for the Z ^ ee channel. 

To determine the number of real V plus real 
events, we follow a procedure similar to that outlined 
above to determine the background contributions to the 
W sample. We split candidate events into ^t/£ regions 
A, B, G, and W, as defined in Table [l| Each region con¬ 
tains mesons produced with signal VF, as well as 

produced with three background sources: W rz/, 
Z ^ £~^£~^ and multijet events. The linear equations 
are solved simultaneously for the rates (/^;+^^) 

at which signal (background) W candidates are produced 
with mesons (see Section 6.3 of Ref. (Hj.) The asso¬ 
ciated systematic uncertainty is determined by repeating 
the procedure for several definitions of regions A, B, G 


and W, first keeping the I boundaries fixed and varying 
the boundaries, and then keeping the boundaries 
fixed and varying the / boundaries. This is done sepa¬ 
rately for the electron and muon W decay channels. The 
resulting systematic uncertainty in /^+ is 10% in the 
W ^ eu channel, and 2% in the W ^ jiv channel. The 
larger uncertainty for the W (^ eu) case is due to a larger 
fraction of multijet background, relative to the W ^ liv 
case. 

The full results of the sample composition studies are 
reported in chapter 6 of Ref. m- We observe the rate 
of mesons produced in events with falsely recon¬ 
structed V bosons to be considerably higher than the 
rate of mesons observed in real V events. This 

arises through Quantum Ghromodynamic (QGD) pro¬ 
cesses producing falsely reconstructed leptons. 

We unfold tagged signal fractions, , to ratios 

cr{V + T>*+)/(7(R), where V = (W, Z), by following the 
prescription 

a{V + D*+) _ _ fU(A-e)v _ 

a{V) (A ■ e)v+D^+B{D*+ ^ K7r)7r,) 

( 1 ) 

Here, the branching ratio S(T)*+ ^ K'k)'Ks) is 

0.0263 ± 0.0004 [17], and {A • e)y, {A • e)y+^>H+ is our 
total (geometric plus kinematic) acceptance for V and 
E -f- events, respectively. 

To determine the V acceptances, we apply the WjZ 
tagging algorithms over inclusive simulated samples gen¬ 
erated using PYTHIA 6.2. For the V + acceptances, 
we determine the values of {A • e) both differentially as a 
function of and for the inclusive set of all events 

with pt( 7^*^) > 3 GeV, the threshold below which the 
analysis acceptance vanishes. In the differential case, 
bins are chosen such that the expected number of tagged 
candidates in each bin is approximately constant ac¬ 
cording to simulation. Bins are as shown in Fig. refer 
to chapter 8 of Ref. [18] for specific values. 

Systematic uncertainties on the production cross sec¬ 
tions arise from sample yield and purity estimates, accep¬ 
tance, efficiency, and simulation [18]. Sources of uncer¬ 
tainty that are common in measurements of total cross 
section (e.^., luminosity and trigger efficiency uncertain¬ 
ties), cancel when taking the ratio cr(V + 

The systematic uncertainty in {A • e) is dominated by 
the uncertainty in the GTEQ m parton-distribution 
function (PDE) of the proton chosen for this analysis. 
Uncertainties due to the PDEs are estimated using 90% 
confidence-level (G.L.) variations on the 20 GTEQ eigen¬ 
vectors. To investigate possible further dependencies on 
the chosen PDE, the MSTW2008 [20] central value is 
also checked. When the difference between GTEQ and 
MSTW2008 is smaller than the uncertainty from varia¬ 
tions on the 20 eigenvectors, no additional uncertainty is 
taken. When the difference is larger, that difference is 
added in quadrature to the eigenvector uncertainty. Un- 
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FIG. 3: Distribution of mass difference between the and candidates in Z and W events, with fit results overlaid. The 
electron and muon decay modes are combined. 


TABLE II: Ratio of cross sections aiV + D*^)/aiV) for the 
inclusive sample > 3 GeV/c, and predictions of 

PYTHIA 6.2.16 simulation using the GTEQ5L PDE. Eor re¬ 
sults from data, the first uncertainty is statistical, the second 
is systematic. Eor results from simulation, uncertainty listed 
is statistical. 




a{V + D*+)/a 

■{v) (in %) 

Sample 

Data 

PYTHIA 6.2.16 

IF 

^ eiy 

1.74 ±0.21 ±0.17 


IF 

— y flV 

1.75 ±0.17 ±0.05 

1.77 ±0.07 

IF 

combined 

1.75 ±0.13 ±0.09 



^ ee 

1.0 ±0.6 ±0.2 



^ MM 

1.8 ±0.5 ±0.2 

1.36 ±0.05 


combined 

1.5 ±0.4 ±0.2 



certainty in as{Mz) at 90% C.L. is taken as an additional 
systematic uncertainty, and added in quadrature to the 
above. 

For both the W and Z samples, we find the ra¬ 
tio a{V + I)*+)/cr(R) for the inclusive set of events 
> 3 GeV/c. For the W samples only, we un¬ 
fold /^+ differentially as a function ofpT(7^*^)- Results 
from the electron and muon decay channels are combined 
using a best-linear-uncertainty estimator, assuming that 
systematic uncertainties are fully correlated across W or 
Z decay modes. The final results are shown in Table [HI 
and Fig. and in all cases agree with the pythia pre¬ 
dictions within uncertainties. 

The LF + signal events are expected to come from 
three different production processes: s{d) g ^ W c 
( Wc), q^q' cc) ( Wcc), and q^q' W^g{-^ 

bb) (Wbb). Due primarily to a difference in 
spectra, a neural network may generally identify 
from different sources (IFcc, IFc, and Wbb) with different 
efficiencies. This enables the fraction of signal from each 
production process to be determined. 

We train two tiers of neural networks; the first con¬ 


sists of networks that are trained to identify one type of 
signal (fFcc, Wc, and Wbb) versus the background 
sample described earlier; the second consists of networks 
that are trained to identify one type of signal over an¬ 
other {Wcc vs Wbb, Wc vs Wcc, and Wbb vs Wc). Each 
first-tier neural network is paired with a second-tier neu¬ 
ral network, in order to select candidates from one 
production source preferentially, over all other can¬ 
didates as described in chapter 10 of Ref. [18]. We apply 
each ANN-pair to simulated signal events to determine 
the efficiencies with which each pair identifies from 
each production source, obtaining a matrix of efficiencies. 
We finally apply each ANN-pair to the data and deter¬ 
mine three yields. We solve the resulting linear equations 
to find fractions xWcc^ j^wbb^ where is 

the fraction of LF + signal that comes from produc¬ 
tion process Y. After repeating the technique by varying 
the efficiencies for each process, we determine that the 
uncertaintly is dominantly statistical. 

This method gives good separation for X^^^, but is 
not very effective at determining the fraction X^^ ow¬ 
ing to similarities in the kinematic properties of from 
LFcc, and from LFc. In the case of IFc production, con¬ 
servation of charge requires that the W and are 

produced with opposite signs. In IFcc or Wbb produc¬ 
tion, however, we are equally likely to identify oppositely 
charged W and {W + D*+)os5 or same-signed W 

and D*+, (LF + D*+)ss- We therefore determine the 
number of IFc events in the W + signal by taking 
the difference (LF + D*+)os ~ (kF + D*+)ss- We divide 
by the sum (LF -f D*+)os + (W H-D*+)ss to get the frac¬ 
tion X^^. The uncertainty in this measurement of X^^ 
is statistical only and treated as uncorrelated with the 
uncertainty in the measurement of X^^^, due to the use 
of a different technique for finding each fraction. Nor¬ 
malized to unity, the production-process fractions in the 
overall W signal are obtained as X^^^ = 73 ±8%, 
XWbb = 13 ± 5 %^ = 14 ± 6%. The relative con- 
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FIG. 4: Ratio of cross sections a(W + D*~^)/a{W) as a func¬ 
tion of for combined W ^ eu and W ^ iii> re¬ 

sults. Error bars show the statistical uncertainty; the sum in 
quadrature of the statistical and systematic errors is shown as 
a yellow error band. The dotted red line shows the prediction 
of PYTHIA 6.2 obtained using the CTEQ5L PDE, with solid 
red lines showing PDE uncertainty in this prediction. The ra¬ 
tio of the simulated distribution to data is shown in the lower 
plot. 

tribution of these processes is momentum dependent. In 
particular, our ability to identify low-momentum 
candidates enhances the Wcc contribution compared to 
analyses performed at higher momenta m- 

In conclusion, we present the first measurement of 
V + production at hadron colliders in the regime 
> 3 GeV/c. The expected rate of produc¬ 
tion in V events is as predicted by pythia 6.2, both for 
the integrated sample > 3 GeV/c, and differen¬ 

tially as a function of 
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